Micro-canonical thermodynamics: Why does heat flow from hot to cold. 
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We show how to use a central limit approximation for additive co-cycles to describe non- 
equilibrium and far from equilibrium thermodynamic behaviour. We consider first two weakly 
coupled Hamiltonian dynamical systems initially at different micro-canonical temperatures. We 
describe a stochastic model where the energy-transfer between the two systems is considered as 
a random variable satisfying a central limit approximation. We show that fluctuations in energy 
observables are linearly related to the heat-transfer (dissipation). As a result, on average, heat flows 
from hot to cold. We also consider the far from equilibrium situation of a non-Hamiltonian ther- 
^ ■ mostatted system as in Evans et al. Phys. Rev. Lett. 71, 2401 (1993). Applying the same central 

limit approximation we re-derive their relation for the violation of the 2nd law of thermodynamics. 
. We note that time-reversal symmetry is not used in our derivation. 

^ ■ PACS numbers: 05.20.Gg, 05.20.-y, 05.45.-a, 05.70.Ln, 02.40.Vh, 02.40.-k, 02.50.-r 

■ Starting with the pioneering work of Boltzmann, equilibrium statistical mechanics has developped into a solid 
O ] corner-stone of theoretical physics. Using perturbative expansions, Kubo considered systems close to equilibrium and 
^ ■ obtained fluctuation-dissipation theorems, relating dissipation (linear response) in the system to fluctuations described 

', by decay of correlation functions. We refer to e.g. jTj] for a nice review. Of great current interest, but less understood, 
^l* ■ is the case of far from equilibrium statistical mechanics. Recent approaches to the subject were initiated by Evans et 
^ \ al. (see also 5] for a review). One considers a thermostated system driven by external forces. The thermostat gives 

■ rise to a phase space contraction which is interpreted as a production of entropy. Through numerical simulations, 
Evans et al. made the interesting observation that the 2nd law of thermodynamics is broken in a systematic way. 
As a model for this phenomena the authors suggest that the dynamical behavior ressembles that of the attractor of 

^ \ an Anosov flow with an underlying time-reversal symmetry. This model has then been further developped by e.g. 
1^ ■ Gallavotti and Cohen Q . We also refer to Kurchan [6] and Lebowitz and Spohn Q for a somewhat different stochastic 
^ ] approach (but still using an inherent time- reversal symmetry). Bustamante et al. [l[ (see also references therein) 
O ■ gives a review of recent physical experiments supporting the theoretical work. 

I ^ i | Our aim below is to provide an elementary description of not only the above mentioned phenomena, but also the 
time evolution of the energy transfer between two weakly coupled hamiltonian systems. Neither thermostats nor 
external forces are involved in this latter case. Our arguments are based upon a strong stochastic assumption, namely 
" that a central limit approximation applies to so-called additive co-cycles in the systems. In particular, we do not need 
' the presence of a time-reversal symmetry. As in Q we obtain in both cases a universal law for the violation of the 
• 2nd law of therm ody namics. Our approach is closely related to the study of the "structure functions" which were 
CsJ ] used by Zwanzig [12| . 

o: 

^ . TWO SYSTEMS AND A WEAK INTERACTION 

Consider hamiltonians Hi and H2 on phase spaces fii and 0.2, respectively. We will study the energy flow between 
_ the two systems that arises from introducing a weak coupling U12 defined on the product space, Vli x ^2- When 
, the (micro-canonical) temperatures of the two systems are different one expects on heuristic grounds that energy 

■ should fiow from the 'hotter' to the 'colder' system. We wish to quantify this phenomenon within the micro-canonical 
ensemble and without introducing external forces, not heat-baths in the problem. Recall, that the flow 0* associated 
to the total Hamiltonian function H = H1+H2 + U12 preserves the total energy as well as the product Liouville volume 
m{d^) — mi{d^i)m2{d£,2) , with ^ ~ being coordinates on fli x r22- In particular, also the micro-canonical 
measure iiE(d^) = 5{H{£) — E)m{dS^) having support on the energy surface i?(^) = is invariant under the flow. 
Our goal is to study the time evolution of e.g. the first system. Hi under the global flow 0* at some fixed total energy 
E. We will adapt a stochastic point of view. The basic idea is that when the coupling is weak and each sub-system 
(hopefully) is mixing fast enough on its proper energy surface then at every instant of time each system is close to 
an equilibrium of that system. In spirit this is close to linear response theory. There, however, one usually applies 
a fixed (small) perturbation and then lets the system evolve to a new equilibrium state close to the original. In our 
context we consider a slow but steady evolution away from the original state. Instead of starting out at a particular 
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point in phase space we start out with an ili-conditional ensemble at time t = 0: 
The normalizing factor, 

defines the i?i-conditional entropy, f{Ei). Now, this conditional ensemble is, in general, not time-invariant when the 
interaction is turned on. At time t > 0, the probability distribution of the values of i?i((/>*(^)) will be given by an 
expression of the form : 



. ,_ jS{H^i<p\0)-x)S{H^{0-E,)f,E{d^) 

" jsiH,iO-E,)MdO 

Wc assume for simplicity of notation that the clistribiition admits a density. More generally one could formulate the 
relation in terms of measures without affecting the conclusions. By normalization, the kernel satisfies: 



Pt{x\Ei) dx = 1. 

It verifies, however, another important identity: Multiplying by e^^^'^^ and integrating with respect to Ei we get 

J Pt{x\E,)ef^'''UE, = J 6 {H,{^\0) - ^) MdO 
= J 5{H,{0-x)f,E{dC) 

= ef^^\ (1) 

where we used the fact that lJ.E{di) is (/ii*-invariant. At present the above expressions are exact. 

We come to the crucial approximation: Consider Xt = Hi {(f>*{^)) as a random variable whose probability distribution 
is given by pt(x\Ei)dx. The mean drift is mt{Ei) = E(X() — Ei = J{x — Ei)pi{x\Ei) dx and the variance is o-fiEi) = 
Var(Art). Both drift and variance are functions of t and Ei. When t tends to zero, pt{x\Ei) — >• S{x — Ei) and when t 
tends to infinity (assuming global mixing) pt{x\Ei) e^'^^^ x const. The energy increment A*{^) = Hi{(f)*{^)) — Hi{^) 
is an 'additive co-cycle'. By this we mean that for all s,t > 0, ^ G Qi x ^2: 

A'+%0=A'o^%^)+A%0- (2) 

When time-correlations decay fast enough, such additive co-cycles tend to have asymptotic properties in common 
with sums of independent random variables. In particular, it may be within reason to assume that it behaves like 
a gaussian variable (at least on certain time scales). This is the case e.g. when looking at smooth observables in 
Anosov systems or in exponentially mixing Markov chains. 

Central Limit Approximation: Assume that there are two characteristic time-scales r^ix <C Tgq where Tmix is a 
'mixing'-time of the sub-systems and Teq is a time-scale for 'significant' changes in the energy of each sub-systems. 
When Tmix <^t <^ Teq we may approximate pt{x\Ei) by the corresponding normal distribution: 



/ I X 1 ( (x- El- in,if > 



Weak Coupling Approximation: We need to be able to calculate derivatives of the relative entropy f{x). This is easy 
if we can neglect the interaction term. In this approximation H{E) — Hi{S^i) + ^2(^2) and 



= j 5{Hi{^i)-x)iiE{dC) 



S (i/i(6) - x) S (i/i(6) -{E- x)) mi(dCi)m2(d^2) 

Si(x)+S2{E-x) j-^-j 
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where Si{x) = log J S {Hi((^i) — x) mi^d^^i), i = 1, 2 are the ^-canonical entropies of the two sub-systems. For each 

. I dSi „ . , . 1 dTi 

sub-system, one may associate its /x-canonical temperature, i.e. — = as well as its heat-capacity — = = 



Ti — — As we showed in earlier papers |ld. Illj. such quantities are computable within the /x-canonical ensemble 



1 dEf 

provided each subsystem is ergodic. On the time-scale Tmix <C < ^ T^q, one may assign local micro-canonical 
thermodynamic characteristics to each sub-system. In the weak coupling limit the global equilibrium of the combined 
system is at the energy Ei = E^"^ for which f'{El'^) = or Ti{El'^) — T2{E — E'l'^), i.e. the two micro-canonical 
temperatures are equal. When heat-capacities are positive (i.e. Si and 5*2 are strictly concave) the corresponding 
energy is unique. But we do not need this for the present discussion. 

Recall that the distribution pt function for Xt leaves the micro-canonical ensemble invariant: 

Pt{x\Ei)ef^^^'>dEi = e-^(^). 

When t is not too large it is reasonable to expect the variance of Xt to be small compared to the inverse of the curvature 
of /. We may then replace f{Ei) by its first order Taylor expansion around x: f{Ei) — f{x) — X{Ei — x) + o{Ei — x) 

with A = fix) = — — — r. We insert this our central limit expression (131) and get for the integral: 

Ti{x) T2[E-x) 

J exp (^-i^^l^^^I^ + X{Ei - x)^ dEi = exp (AV2/2 - Xrut) = 1. 

This implies that either A = (which corresponds to the systems having identical temperatures, i.e they are in 
thermodynamic equilibrium) or, more interestingly, when A is non-zero we get the relation rrit = ^Actj . We have 
obtained the following: 



Fluctuation Dissipation Relation. Under the Central Limit Approximation we have for Ty^ix <C i <C T( 



cq- 



PtiEi+u\ El) _2rrH _ ( _l 1 ^ , . 

'°^Pt{Ei-u\Ei)- a^^'-yTiiEi) ^E - Ei] ' 



The first equality states that the mean drift in energy (dissipation) of each sub-system is proportional to the 
fluctuations in the sub-system with a constant of proportionality being the difference of the inverse temperatures 
of the two systems. Since fluctuations are non- negative, on average energy flows from 'hot' to 'cold'. The second 
equality is obtained by combining expression ([3]) and the fluctuation dissipation relation ([5]). It expresses the relative 
probability of a violation of the 2nd law of thermodynamics at certain time and energy scales. 

VALIDITY AND COMPUTABILITY 

Estimates both for pt{x\Ei) and temperatures may be obtained through numerical simulations thus allowing for 
a verification of our Central Limit Approximation (CLA) as well as the Fluctuation Dissipation Relation (FDR). 
Assuming ergodicity it suffices for pt{x\Ei) to run the system without interactions to get initial points representing 
the i?i-conditional ensemble, then turn on the (weak) interaction and run the ensemble to give estimates for this 



transition probability. For the temperatures of the sub-systems, we may e.g. use |ld. Ill| : If X^, i = 1,2 are vector 
fields on Tili for which dHiCKi) = 1 then, without interactions, we have l/T^ = {diVrn^{'K.i)\Ei) , the ergodic average 
of the observable divmii^i) at energy Ei for each sub-system. Under weak interactions but assuming that sub- 
system energies varies slowly compared to its ergodic averaging, such expressions are good candidates for temperature 
observables for the two sub-systems. 

For numerical reasons, interactions should be small but not too small. The interaction constitute background 
fluctuations of order u! — 0{Ui2). If this is too large, it is unlikely that one may observe the FD-relation. On the 
other hand energy exchange is a second order phenomena (see e.g. [121) we have to wait a time of order 1/uj to get 
an effective energy transfer exceeding the background noise. Numerical errors could then create problems. 
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The expression ^ for pt{x\Ei) could be a good approximation even on time scales smaller than the mixing time of 
sub-systems. The reason is that for a large system space mixing may give rise to a good central limit approximation 
for the observable Hi even without time-mixing. The FDR may, however, fail in that case. As a (non-generic, though) 
example suppose that system 2 itself is a sum of two sub-systems with a small interaction: H2 ^ Ha + Vat + H^. 
We now add an interaction of the form U12 ~ Ui^a — Vab to Hi + 7?2- This has the effect of coupling Hi and Ha 
but decoupling their sum from Hf,. As a result there will be fluctuations in Hi but the predicted mean drift will in 
general be wrong, there will be no global equilibration and the fluctuation dissipation relation as stated should fail. 



A THERMOSTATED NON-EQUILIBRIUM SYSTEM 

We now consider a situation as described in [2] in which the authors consider a Hamiltonian H on a phase space fl 
and subjected to a thermostat. One associates to this a non-Hamiltonian vector field Xh and its fiow (/>*. For details 
of the construction see f^. For our purposes, the essential properties may be summarized as follows: The Hamiltonian 
fiow 0* preserves H but not the Liouville space volume m. We write = 5{H — E)m for the associated area form 
on the energy surface. Neither m, nor he is invariant under Xh- The Jacobian Jac*(^) = m{(f>*d£_)/m{d£^) describes 
the volume transformation along the flow. Because of iJ-invariance we have Jac*(^) = iiEi'P'^dA)/ iiE{dA) as well, i.e. 
it is the same Jacobian for the surface area and for the bulk volume. To see this one may e.g. use differential forms 
and take the Lie derivative of hh- Lxh {^{H ~ E)m) — 5{H — E)LxH''Ti = S{H — E)divm{XH)m, where the first 
equality is due to LxH — dH{X) = 0. It shows that infinitesimally m and fiE have the same Jacobian so this is also 
the case for the flow. 

One wants to observe the phase space contraction rate, manifested by the above Jacobian. The Jacobian is 
multiplicative and not additive but taking a logarithm, we get an additive co-cycle as before. So, our observable will 
be A* = log Jac*(^), which verifies — A* o (f)^ + A^ . It is a computable, i.e. observable quantity in this context. 
The object of interest is then the distribution function for A* which we consider in the ^^-ensemble: 

Pt{a) = . 

Since 0* : J7 — > is a diffeomorphism we get by change of variables: 

Jac*(C) MO- 



SI J4>*n Jn 



Inserting the distribution function for A* we get the (exact) relation: 

/^Jac*(e) fiEiO 



This is a constraint equation for the distribution function pt . As At is an additive co-cycle we again make the strong 
stochastic assumption that we may approximate pt by a normal distribution, pt ^ Af{mt, CTj ). Doing so and inserting 
in the above constraint equation yields exp(mt -I- <t^/2) = 1 01 mt ~ — cr^/2. This is the FD relation of e.g. [2I, 
And as in these cited papers one has the symmetry-relation 

Pt{a)/pt{-a) = e". 

There is an important approximation taking place when comparing the above derivation and the numerical simula- 
tions. In our derivation the distribution of the observable At is with respect to the initial distribution he — 5{H ^ E)ni 
whereas the numerical computations are done for a (hopefully) stationary state of the system. This distinction also 
makes a subtle difference in the point of view of ^ and "3^. For an Anosov system, the distinction is not important. 
In both ensembles a central limit approximation hold and with the same constants. Working in the stationary state, 
however, is numerically more stable as it eliminates the contribution of transients which can be quite large. For more 
realistic models, it would be of interest to compare numerically the two ensemble distributions. 

We note that time-reversal symmetry is not needed in the above derivation. Again it would be interesting to 
compare with numerical simulations for a system without time-reversal symmetry. Mittag et al presented such a 
system in Q for which the distribution function pt was quite far from gaussian and the FD-relation fails. This, 
however, does not contradict the above derivation since in their case the external field was changed during the time 
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span of the experiment. A* is then not an additive co-cycle so transient behaviour becomes significant. We also 
note that our derivation does not make use of the underlying sympltic structure of phase space. So in principle our 
derivation makes sense for any dynamical system on a compact manifold (here, {H = E}) that converges fast enough 
towards a natural measure and for which correlations decay fast enough. 
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